Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 4366 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 614.1 KiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 18 |
|---|
purchases is highly correlated with quantity_p and 4 other fields | High correlation |
devolutions is highly correlated with recency_d and 4 other fields | High correlation |
recency_p is highly correlated with invoices_p and 1 other fields | High correlation |
recency_d is highly correlated with devolutions and 4 other fields | High correlation |
quantity_p is highly correlated with purchases and 4 other fields | High correlation |
quantity_d is highly correlated with devolutions and 4 other fields | High correlation |
invoices_p is highly correlated with purchases and 5 other fields | High correlation |
invoices_d is highly correlated with devolutions and 4 other fields | High correlation |
avg_ticket is highly correlated with avg_variety | High correlation |
avg_recency_days is highly correlated with purchases and 4 other fields | High correlation |
avg_basket_size is highly correlated with purchases and 2 other fields | High correlation |
avg_variety is highly correlated with avg_ticket | High correlation |
purchases_pday is highly correlated with invoices_p | High correlation |
gross_revenue is highly correlated with purchases and 4 other fields | High correlation |
relative_revenue is highly correlated with devolutions and 4 other fields | High correlation |
relative_quantity is highly correlated with devolutions and 4 other fields | High correlation |
purchases is highly correlated with quantity_p and 2 other fields | High correlation |
devolutions is highly correlated with quantity_p and 3 other fields | High correlation |
recency_p is highly correlated with avg_recency_days | High correlation |
recency_d is highly correlated with invoices_d | High correlation |
quantity_p is highly correlated with purchases and 4 other fields | High correlation |
quantity_d is highly correlated with devolutions and 3 other fields | High correlation |
invoices_p is highly correlated with purchases and 2 other fields | High correlation |
invoices_d is highly correlated with recency_d and 1 other fields | High correlation |
avg_ticket is highly correlated with devolutions and 3 other fields | High correlation |
avg_recency_days is highly correlated with recency_p and 1 other fields | High correlation |
avg_basket_size is highly correlated with devolutions and 3 other fields | High correlation |
purchases_pday is highly correlated with avg_recency_days | High correlation |
gross_revenue is highly correlated with purchases and 1 other fields | High correlation |
relative_revenue is highly correlated with relative_quantity | High correlation |
relative_quantity is highly correlated with relative_revenue | High correlation |
purchases is highly correlated with quantity_p and 2 other fields | High correlation |
devolutions is highly correlated with recency_d and 4 other fields | High correlation |
recency_p is highly correlated with avg_recency_days | High correlation |
recency_d is highly correlated with devolutions and 4 other fields | High correlation |
quantity_p is highly correlated with purchases and 2 other fields | High correlation |
quantity_d is highly correlated with devolutions and 4 other fields | High correlation |
invoices_p is highly correlated with purchases and 2 other fields | High correlation |
invoices_d is highly correlated with devolutions and 4 other fields | High correlation |
avg_recency_days is highly correlated with recency_p and 1 other fields | High correlation |
avg_basket_size is highly correlated with quantity_p | High correlation |
gross_revenue is highly correlated with purchases and 2 other fields | High correlation |
relative_revenue is highly correlated with devolutions and 4 other fields | High correlation |
relative_quantity is highly correlated with devolutions and 4 other fields | High correlation |
df_index is highly correlated with recency_p and 1 other fields | High correlation |
purchases is highly correlated with devolutions and 7 other fields | High correlation |
devolutions is highly correlated with purchases and 4 other fields | High correlation |
recency_p is highly correlated with df_index and 1 other fields | High correlation |
quantity_p is highly correlated with purchases and 5 other fields | High correlation |
quantity_d is highly correlated with purchases and 4 other fields | High correlation |
invoices_p is highly correlated with purchases and 2 other fields | High correlation |
invoices_d is highly correlated with purchases and 2 other fields | High correlation |
avg_ticket is highly correlated with purchases and 4 other fields | High correlation |
avg_recency_days is highly correlated with df_index and 1 other fields | High correlation |
avg_basket_size is highly correlated with purchases and 4 other fields | High correlation |
gross_revenue is highly correlated with purchases and 3 other fields | High correlation |
relative_revenue is highly correlated with relative_quantity | High correlation |
relative_quantity is highly correlated with relative_revenue | High correlation |
devolutions is highly skewed (γ1 = 47.38221518) | Skewed |
quantity_p is highly skewed (γ1 = 30.93123636) | Skewed |
quantity_d is highly skewed (γ1 = 45.51559535) | Skewed |
avg_ticket is highly skewed (γ1 = 46.64636797) | Skewed |
avg_basket_size is highly skewed (γ1 = 48.13011902) | Skewed |
gross_revenue is highly skewed (γ1 = 21.69086914) | Skewed |
df_index has unique values | Unique |
customer_id has unique values | Unique |
devolutions has 2778 (63.6%) zeros | Zeros |
quantity_d has 2778 (63.6%) zeros | Zeros |
invoices_d has 2778 (63.6%) zeros | Zeros |
Reproduction
| Analysis started | 2022-02-04 18:20:05.289728 |
|---|---|
| Analysis finished | 2022-02-04 18:20:44.647390 |
| Duration | 39.36 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 4366 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2837.469079 |
| Minimum | 0 |
|---|---|
| Maximum | 5970 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 229.25 |
| Q1 | 1303.25 |
| median | 2733.5 |
| Q3 | 4423.75 |
| 95-th percentile | 5635.75 |
| Maximum | 5970 |
| Range | 5970 |
| Interquartile range (IQR) | 3120.5 |
Descriptive statistics
| Standard deviation | 1758.395649 |
|---|---|
| Coefficient of variation (CV) | 0.6197056603 |
| Kurtosis | -1.238510832 |
| Mean | 2837.469079 |
| Median Absolute Deviation (MAD) | 1550 |
| Skewness | 0.1035389668 |
| Sum | 12388390 |
| Variance | 3091955.26 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 3799 | 1 | < 0.1% |
| 3805 | 1 | < 0.1% |
| 3804 | 1 | < 0.1% |
| 3803 | 1 | < 0.1% |
| 3802 | 1 | < 0.1% |
| 3801 | 1 | < 0.1% |
| 3800 | 1 | < 0.1% |
| 3794 | 1 | < 0.1% |
| 3966 | 1 | < 0.1% |
| Other values (4356) | 4356 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 5970 | 1 | |
| 5963 | 1 | |
| 5962 | 1 | |
| 5960 | 1 | |
| 5958 | 1 | |
| 5954 | 1 | |
| 5953 | 1 | |
| 5952 | 1 | |
| 5951 | 1 | |
| 5950 | 1 |
| Distinct | 4366 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15301.45121 |
| Minimum | 12346 |
|---|---|
| Maximum | 18287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 12346 |
|---|---|
| 5-th percentile | 12614.25 |
| Q1 | 13814.25 |
| median | 15303.5 |
| Q3 | 16779.75 |
| 95-th percentile | 17984.75 |
| Maximum | 18287 |
| Range | 5941 |
| Interquartile range (IQR) | 2965.5 |
Descriptive statistics
| Standard deviation | 1722.144646 |
|---|---|
| Coefficient of variation (CV) | 0.1125477984 |
| Kurtosis | -1.195850103 |
| Mean | 15301.45121 |
| Median Absolute Deviation (MAD) | 1484.5 |
| Skewness | 0.0001007032311 |
| Sum | 66806136 |
| Variance | 2965782.181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17850 | 1 | < 0.1% |
| 15909 | 1 | < 0.1% |
| 12450 | 1 | < 0.1% |
| 15334 | 1 | < 0.1% |
| 17562 | 1 | < 0.1% |
| 17879 | 1 | < 0.1% |
| 16050 | 1 | < 0.1% |
| 13618 | 1 | < 0.1% |
| 16869 | 1 | < 0.1% |
| 13730 | 1 | < 0.1% |
| Other values (4356) | 4356 |
| Value | Count | Frequency (%) |
| 12346 | 1 | |
| 12347 | 1 | |
| 12348 | 1 | |
| 12349 | 1 | |
| 12350 | 1 | |
| 12352 | 1 | |
| 12353 | 1 | |
| 12354 | 1 | |
| 12355 | 1 | |
| 12356 | 1 |
| Value | Count | Frequency (%) |
| 18287 | 1 | |
| 18283 | 1 | |
| 18282 | 1 | |
| 18281 | 1 | |
| 18280 | 1 | |
| 18278 | 1 | |
| 18277 | 1 | |
| 18276 | 1 | |
| 18274 | 1 | |
| 18273 | 1 |
| Distinct | 4249 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2040.183367 |
| Minimum | 0 |
|---|---|
| Maximum | 280206.02 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 107.4875 |
| Q1 | 303.3075 |
| median | 665.82 |
| Q3 | 1654.0875 |
| 95-th percentile | 5775.52 |
| Maximum | 280206.02 |
| Range | 280206.02 |
| Interquartile range (IQR) | 1350.78 |
Descriptive statistics
| Standard deviation | 8962.013605 |
|---|---|
| Coefficient of variation (CV) | 4.392749078 |
| Kurtosis | 480.9126511 |
| Mean | 2040.183367 |
| Median Absolute Deviation (MAD) | 465.995 |
| Skewness | 19.38102887 |
| Sum | 8907440.58 |
| Variance | 80317687.85 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.8% |
| 76.32 | 4 | 0.1% |
| 440 | 3 | 0.1% |
| 35.4 | 3 | 0.1% |
| 15 | 3 | 0.1% |
| 363.65 | 3 | 0.1% |
| 85 | 2 | < 0.1% |
| 122.7 | 2 | < 0.1% |
| 110.38 | 2 | < 0.1% |
| 238.85 | 2 | < 0.1% |
| Other values (4239) | 4309 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 3.75 | 1 | < 0.1% |
| 6.2 | 1 | < 0.1% |
| 6.9 | 1 | < 0.1% |
| 12.75 | 1 | < 0.1% |
| 13.3 | 1 | < 0.1% |
| 15 | 3 | 0.1% |
| 17 | 1 | < 0.1% |
| 20.8 | 2 | < 0.1% |
| 25.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 280206.02 | 1 | |
| 259657.3 | 1 | |
| 194550.79 | 1 | |
| 168472.5 | 1 | |
| 143825.06 | 1 | |
| 124914.53 | 1 | |
| 117379.63 | 1 | |
| 91062.38 | 1 | |
| 81024.84 | 1 | |
| 77183.6 | 1 |
devolutions
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 1156 |
|---|---|
| Distinct (%) | 26.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 140.019787 |
| Minimum | 0 |
|---|---|
| Maximum | 168469.6 |
| Zeros | 2778 |
| Zeros (%) | 63.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 16.575 |
| 95-th percentile | 191.135 |
| Maximum | 168469.6 |
| Range | 168469.6 |
| Interquartile range (IQR) | 16.575 |
Descriptive statistics
| Standard deviation | 2954.51808 |
|---|---|
| Coefficient of variation (CV) | 21.10071829 |
| Kurtosis | 2530.699539 |
| Mean | 140.019787 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 47.38221518 |
| Sum | 611326.39 |
| Variance | 8729177.088 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2778 | |
| 12.75 | 21 | 0.5% |
| 4.95 | 18 | 0.4% |
| 15 | 15 | 0.3% |
| 9.95 | 15 | 0.3% |
| 5.9 | 12 | 0.3% |
| 25.5 | 11 | 0.3% |
| 4.25 | 10 | 0.2% |
| 3.75 | 9 | 0.2% |
| 19.9 | 9 | 0.2% |
| Other values (1146) | 1468 |
| Value | Count | Frequency (%) |
| 0 | 2778 | |
| 0.42 | 2 | < 0.1% |
| 0.65 | 1 | < 0.1% |
| 0.77 | 1 | < 0.1% |
| 0.95 | 1 | < 0.1% |
| 1.25 | 5 | 0.1% |
| 1.45 | 4 | 0.1% |
| 1.64 | 1 | < 0.1% |
| 1.65 | 5 | 0.1% |
| 1.7 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 168469.6 | 1 | |
| 77183.6 | 1 | |
| 39267 | 1 | |
| 30032.23 | 1 | |
| 22998.4 | 1 | |
| 12158.9 | 1 | |
| 11252.44 | 1 | |
| 8593.15 | 1 | |
| 8495.01 | 1 | |
| 8043.88 | 1 |
| Distinct | 304 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.07558406 |
| Minimum | 0 |
|---|---|
| Maximum | 373 |
| Zeros | 35 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 17 |
| median | 51 |
| Q3 | 147 |
| 95-th percentile | 318 |
| Maximum | 373 |
| Range | 373 |
| Interquartile range (IQR) | 130 |
Descriptive statistics
| Standard deviation | 102.4445304 |
|---|---|
| Coefficient of variation (CV) | 1.088959812 |
| Kurtosis | 0.3775336914 |
| Mean | 94.07558406 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 1.235556949 |
| Sum | 410734 |
| Variance | 10494.8818 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 103 | 2.4% |
| 3 | 94 | 2.2% |
| 4 | 94 | 2.2% |
| 2 | 90 | 2.1% |
| 8 | 79 | 1.8% |
| 10 | 77 | 1.8% |
| 17 | 74 | 1.7% |
| 7 | 72 | 1.6% |
| 9 | 71 | 1.6% |
| 22 | 64 | 1.5% |
| Other values (294) | 3548 |
| Value | Count | Frequency (%) |
| 0 | 35 | 0.8% |
| 1 | 103 | |
| 2 | 90 | |
| 3 | 94 | |
| 4 | 94 | |
| 5 | 48 | |
| 7 | 72 | |
| 8 | 79 | |
| 9 | 71 | |
| 10 | 77 |
| Value | Count | Frequency (%) |
| 373 | 17 | 0.4% |
| 372 | 18 | |
| 371 | 6 | 0.1% |
| 369 | 3 | 0.1% |
| 368 | 5 | 0.1% |
| 367 | 5 | 0.1% |
| 366 | 10 | 0.2% |
| 365 | 43 | |
| 364 | 6 | 0.1% |
| 362 | 6 | 0.1% |
| Distinct | 278 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 280.8964727 |
| Minimum | 0 |
|---|---|
| Maximum | 373 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 185 |
| median | 365 |
| Q3 | 365 |
| 95-th percentile | 365 |
| Maximum | 373 |
| Range | 373 |
| Interquartile range (IQR) | 180 |
Descriptive statistics
| Standard deviation | 129.9871958 |
|---|---|
| Coefficient of variation (CV) | 0.4627583768 |
| Kurtosis | -0.4666517161 |
| Mean | 280.8964727 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.118539076 |
| Sum | 1226394 |
| Variance | 16896.67107 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 365 | 2792 | |
| 8 | 44 | 1.0% |
| 64 | 39 | 0.9% |
| 35 | 28 | 0.6% |
| 3 | 28 | 0.6% |
| 9 | 26 | 0.6% |
| 21 | 26 | 0.6% |
| 25 | 22 | 0.5% |
| 29 | 22 | 0.5% |
| 31 | 20 | 0.5% |
| Other values (268) | 1319 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 20 | |
| 2 | 13 | 0.3% |
| 3 | 28 | |
| 4 | 13 | 0.3% |
| 5 | 4 | 0.1% |
| 7 | 10 | 0.2% |
| 8 | 44 | |
| 9 | 26 | |
| 10 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 373 | 1 | < 0.1% |
| 372 | 8 | 0.2% |
| 371 | 2 | < 0.1% |
| 369 | 2 | < 0.1% |
| 368 | 8 | 0.2% |
| 367 | 1 | < 0.1% |
| 366 | 7 | 0.2% |
| 365 | 2792 | |
| 364 | 1 | < 0.1% |
| 362 | 2 | < 0.1% |
quantity_p
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 777 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 311.8268438 |
| Minimum | 0 |
|---|---|
| Maximum | 80996 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 60 |
| median | 115 |
| Q3 | 234 |
| 95-th percentile | 710 |
| Maximum | 80996 |
| Range | 80996 |
| Interquartile range (IQR) | 174 |
Descriptive statistics
| Standard deviation | 1965.923119 |
|---|---|
| Coefficient of variation (CV) | 6.304534579 |
| Kurtosis | 1148.530998 |
| Mean | 311.8268438 |
| Median Absolute Deviation (MAD) | 69 |
| Skewness | 30.93123636 |
| Sum | 1361436 |
| Variance | 3864853.711 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 42 | 1.0% |
| 60 | 40 | 0.9% |
| 67 | 39 | 0.9% |
| 51 | 37 | 0.8% |
| 72 | 36 | 0.8% |
| 52 | 35 | 0.8% |
| 66 | 34 | 0.8% |
| 70 | 33 | 0.8% |
| 0 | 33 | 0.8% |
| 36 | 32 | 0.7% |
| Other values (767) | 4005 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 1 | 11 | 0.3% |
| 2 | 5 | 0.1% |
| 3 | 14 | |
| 4 | 6 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 14 | |
| 7 | 3 | 0.1% |
| 8 | 5 | 0.1% |
| 9 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 80996 | 1 | |
| 74215 | 1 | |
| 38639 | 1 | |
| 21352 | 1 | |
| 17376 | 1 | |
| 17150 | 1 | |
| 16288 | 1 | |
| 15853 | 1 | |
| 13369 | 1 | |
| 12872 | 1 |
quantity_d
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 186 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.71071919 |
| Minimum | -0 |
|---|---|
| Maximum | 80995 |
| Zeros | 2778 |
| Zeros (%) | 63.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | -0 |
| Q3 | 3 |
| 95-th percentile | 42 |
| Maximum | 80995 |
| Range | 80995 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1678.948683 |
|---|---|
| Coefficient of variation (CV) | 30.13690556 |
| Kurtosis | 2109.732802 |
| Mean | 55.71071919 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 45.51559535 |
| Sum | 243233 |
| Variance | 2818868.681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0 | 2778 | |
| 1 | 349 | 8.0% |
| 3 | 173 | 4.0% |
| 6 | 90 | 2.1% |
| 2 | 89 | 2.0% |
| 4 | 75 | 1.7% |
| 5 | 45 | 1.0% |
| 12 | 45 | 1.0% |
| 7 | 42 | 1.0% |
| 8 | 40 | 0.9% |
| Other values (176) | 640 | 14.7% |
| Value | Count | Frequency (%) |
| -0 | 2778 | |
| 1 | 349 | 8.0% |
| 2 | 89 | 2.0% |
| 3 | 173 | 4.0% |
| 4 | 75 | 1.7% |
| 5 | 45 | 1.0% |
| 6 | 90 | 2.1% |
| 7 | 42 | 1.0% |
| 8 | 40 | 0.9% |
| 9 | 37 | 0.8% |
| Value | Count | Frequency (%) |
| 80995 | 1 | |
| 74215 | 1 | |
| 9361 | 1 | |
| 9014 | 1 | |
| 4873 | 1 | |
| 4027 | 1 | |
| 2399 | 1 | |
| 2302 | 1 | |
| 2160 | 1 | |
| 1685 | 1 |
| Distinct | 60 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.241868988 |
| Minimum | 0 |
|---|---|
| Maximum | 209 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 13 |
| Maximum | 209 |
| Range | 209 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 7.681885185 |
|---|---|
| Coefficient of variation (CV) | 1.810967101 |
| Kurtosis | 249.7371905 |
| Mean | 4.241868988 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 12.0756929 |
| Sum | 18520 |
| Variance | 59.01135999 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1493 | |
| 2 | 831 | |
| 3 | 508 | 11.6% |
| 4 | 387 | 8.9% |
| 5 | 242 | 5.5% |
| 6 | 172 | 3.9% |
| 7 | 143 | 3.3% |
| 8 | 98 | 2.2% |
| 9 | 68 | 1.6% |
| 10 | 54 | 1.2% |
| Other values (50) | 370 | 8.5% |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.8% |
| 1 | 1493 | |
| 2 | 831 | |
| 3 | 508 | 11.6% |
| 4 | 387 | 8.9% |
| 5 | 242 | 5.5% |
| 6 | 172 | 3.9% |
| 7 | 143 | 3.3% |
| 8 | 98 | 2.2% |
| 9 | 68 | 1.6% |
| Value | Count | Frequency (%) |
| 209 | 1 | |
| 201 | 1 | |
| 124 | 1 | |
| 97 | 1 | |
| 93 | 1 | |
| 91 | 1 | |
| 86 | 1 | |
| 73 | 1 | |
| 63 | 1 | |
| 62 | 1 |
invoices_d
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 27 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8364635822 |
| Minimum | 0 |
|---|---|
| Maximum | 47 |
| Zeros | 2778 |
| Zeros (%) | 63.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 47 |
| Range | 47 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.136425755 |
|---|---|
| Coefficient of variation (CV) | 2.55411688 |
| Kurtosis | 134.1595049 |
| Mean | 0.8364635822 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.846668074 |
| Sum | 3652 |
| Variance | 4.564315005 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2778 | |
| 1 | 886 | 20.3% |
| 2 | 308 | 7.1% |
| 3 | 147 | 3.4% |
| 4 | 97 | 2.2% |
| 5 | 44 | 1.0% |
| 6 | 30 | 0.7% |
| 7 | 22 | 0.5% |
| 8 | 9 | 0.2% |
| 9 | 7 | 0.2% |
| Other values (17) | 38 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 2778 | |
| 1 | 886 | 20.3% |
| 2 | 308 | 7.1% |
| 3 | 147 | 3.4% |
| 4 | 97 | 2.2% |
| 5 | 44 | 1.0% |
| 6 | 30 | 0.7% |
| 7 | 22 | 0.5% |
| 8 | 9 | 0.2% |
| 9 | 7 | 0.2% |
| Value | Count | Frequency (%) |
| 47 | 1 | |
| 45 | 1 | |
| 35 | 1 | |
| 31 | 1 | |
| 27 | 1 | |
| 23 | 1 | |
| 21 | 1 | |
| 19 | 2 | |
| 18 | 1 | |
| 17 | 2 |
| Distinct | 4292 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.89335507 |
| Minimum | 0 |
|---|---|
| Maximum | 77183.6 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.356321839 |
| Q1 | 11.99768757 |
| median | 17.67979201 |
| Q3 | 24.79484848 |
| 95-th percentile | 93.3 |
| Maximum | 77183.6 |
| Range | 77183.6 |
| Interquartile range (IQR) | 12.79716091 |
Descriptive statistics
| Standard deviation | 1463.214147 |
|---|---|
| Coefficient of variation (CV) | 21.55165473 |
| Kurtosis | 2263.833843 |
| Mean | 67.89335507 |
| Median Absolute Deviation (MAD) | 6.449171759 |
| Skewness | 46.64636797 |
| Sum | 296422.3883 |
| Variance | 2140995.64 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.8% |
| 15 | 5 | 0.1% |
| 76.32 | 4 | 0.1% |
| 25.5 | 4 | 0.1% |
| 179 | 4 | 0.1% |
| 18.7 | 3 | 0.1% |
| 358 | 2 | < 0.1% |
| 13.59 | 2 | < 0.1% |
| 15.174 | 2 | < 0.1% |
| 71.4 | 2 | < 0.1% |
| Other values (4282) | 4305 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 2.101285714 | 1 | < 0.1% |
| 2.150588235 | 1 | < 0.1% |
| 2.241 | 1 | < 0.1% |
| 2.264375 | 1 | < 0.1% |
| 2.4325 | 1 | < 0.1% |
| 2.462371134 | 1 | < 0.1% |
| 2.504876033 | 1 | < 0.1% |
| 2.50837156 | 1 | < 0.1% |
| 2.54704918 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 77183.6 | 1 | |
| 56157.5 | 1 | |
| 13305.5 | 1 | |
| 4453.43 | 1 | |
| 3861 | 1 | |
| 3096 | 1 | |
| 2033.1 | 1 | |
| 2027.86 | 1 | |
| 1687.2 | 1 | |
| 1377.077778 | 1 |
avg_recency_days
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1014 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.82100731 |
| Minimum | 0 |
|---|---|
| Maximum | 373 |
| Zeros | 38 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.25 |
| Q1 | 3.75 |
| median | 19.25 |
| Q3 | 78 |
| 95-th percentile | 298.75 |
| Maximum | 373 |
| Range | 373 |
| Interquartile range (IQR) | 74.25 |
Descriptive statistics
| Standard deviation | 95.0274444 |
|---|---|
| Coefficient of variation (CV) | 1.46599765 |
| Kurtosis | 2.196536983 |
| Mean | 64.82100731 |
| Median Absolute Deviation (MAD) | 18.25 |
| Skewness | 1.794820946 |
| Sum | 283008.5179 |
| Variance | 9030.215189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 61 | 1.4% |
| 2 | 52 | 1.2% |
| 4 | 42 | 1.0% |
| 3 | 40 | 0.9% |
| 0 | 38 | 0.9% |
| 0.5 | 35 | 0.8% |
| 5 | 33 | 0.8% |
| 25 | 32 | 0.7% |
| 8 | 31 | 0.7% |
| 7 | 29 | 0.7% |
| Other values (1004) | 3973 |
| Value | Count | Frequency (%) |
| 0 | 38 | |
| 0.006849315068 | 1 | < 0.1% |
| 0.008849557522 | 1 | < 0.1% |
| 0.01123595506 | 1 | < 0.1% |
| 0.01818181818 | 1 | < 0.1% |
| 0.01886792453 | 1 | < 0.1% |
| 0.02127659574 | 1 | < 0.1% |
| 0.02173913043 | 1 | < 0.1% |
| 0.02409638554 | 1 | < 0.1% |
| 0.025 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 373 | 15 | |
| 372 | 17 | |
| 371 | 7 | |
| 369 | 3 | 0.1% |
| 368 | 5 | 0.1% |
| 367 | 5 | 0.1% |
| 366 | 8 | |
| 365 | 10 | |
| 364 | 5 | 0.1% |
| 362 | 6 | 0.1% |
avg_basket_size
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 2136 |
|---|---|
| Distinct (%) | 48.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 251.6187465 |
| Minimum | 0 |
|---|---|
| Maximum | 74215 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30.54166667 |
| Q1 | 91 |
| median | 160.5666667 |
| Q3 | 270.25 |
| 95-th percentile | 598.6 |
| Maximum | 74215 |
| Range | 74215 |
| Interquartile range (IQR) | 179.25 |
Descriptive statistics
| Standard deviation | 1308.868296 |
|---|---|
| Coefficient of variation (CV) | 5.201791654 |
| Kurtosis | 2541.880608 |
| Mean | 251.6187465 |
| Median Absolute Deviation (MAD) | 81.65 |
| Skewness | 48.13011902 |
| Sum | 1098567.447 |
| Variance | 1713136.215 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.8% |
| 100 | 19 | 0.4% |
| 82 | 18 | 0.4% |
| 136 | 17 | 0.4% |
| 120 | 17 | 0.4% |
| 88 | 17 | 0.4% |
| 72 | 17 | 0.4% |
| 106 | 16 | 0.4% |
| 73 | 16 | 0.4% |
| 78 | 16 | 0.4% |
| Other values (2126) | 4180 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 1 | 6 | 0.1% |
| 1.5 | 1 | < 0.1% |
| 2 | 5 | 0.1% |
| 3 | 2 | < 0.1% |
| 3.333333333 | 1 | < 0.1% |
| 4 | 8 | 0.2% |
| 5 | 3 | 0.1% |
| 5.333333333 | 1 | < 0.1% |
| 5.666666667 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 74215 | 1 | |
| 40498.5 | 1 | |
| 7824 | 1 | |
| 6009.333333 | 1 | |
| 4300 | 1 | |
| 4280 | 1 | |
| 3684.47619 | 1 | |
| 3028 | 1 | |
| 2924 | 1 | |
| 2880 | 1 |
| Distinct | 1043 |
|---|---|
| Distinct (%) | 23.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.0160071 |
| Minimum | 0 |
|---|---|
| Maximum | 300.6470588 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.229166667 |
| Q1 | 9.186363636 |
| median | 16.87301587 |
| Q3 | 28 |
| 95-th percentile | 59 |
| Maximum | 300.6470588 |
| Range | 300.6470588 |
| Interquartile range (IQR) | 18.81363636 |
Descriptive statistics
| Standard deviation | 20.31141942 |
|---|---|
| Coefficient of variation (CV) | 0.9225750757 |
| Kurtosis | 18.74952384 |
| Mean | 22.0160071 |
| Median Absolute Deviation (MAD) | 8.873015873 |
| Skewness | 3.031567868 |
| Sum | 96121.88702 |
| Variance | 412.5537589 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 99 | 2.3% |
| 13 | 90 | 2.1% |
| 14 | 84 | 1.9% |
| 10 | 82 | 1.9% |
| 11 | 81 | 1.9% |
| 9 | 78 | 1.8% |
| 6 | 75 | 1.7% |
| 7 | 74 | 1.7% |
| 5 | 71 | 1.6% |
| 8 | 69 | 1.6% |
| Other values (1033) | 3563 |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.8% |
| 1 | 99 | |
| 1.2 | 1 | < 0.1% |
| 1.25 | 1 | < 0.1% |
| 1.333333333 | 2 | < 0.1% |
| 1.5 | 9 | 0.2% |
| 1.555555556 | 1 | < 0.1% |
| 1.571428571 | 1 | < 0.1% |
| 1.666666667 | 4 | 0.1% |
| 1.833333333 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 300.6470588 | 1 | |
| 219 | 1 | |
| 203.5 | 1 | |
| 191 | 1 | |
| 171 | 1 | |
| 164 | 1 | |
| 158 | 1 | |
| 157 | 1 | |
| 153 | 1 | |
| 149 | 1 |
| Distinct | 1243 |
|---|---|
| Distinct (%) | 28.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4005636754 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 33 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.009592344556 |
| Q1 | 0.01989485241 |
| median | 0.04474578328 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 0.9801051476 |
Descriptive statistics
| Standard deviation | 0.5606445071 |
|---|---|
| Coefficient of variation (CV) | 1.399638913 |
| Kurtosis | 175.8260771 |
| Mean | 0.4005636754 |
| Median Absolute Deviation (MAD) | 0.03328437113 |
| Skewness | 6.66821218 |
| Sum | 1748.861007 |
| Variance | 0.3143222633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1502 | |
| 2 | 50 | 1.1% |
| 0 | 33 | 0.8% |
| 0.0625 | 18 | 0.4% |
| 0.02777777778 | 17 | 0.4% |
| 0.02380952381 | 17 | 0.4% |
| 0.09090909091 | 15 | 0.3% |
| 0.08333333333 | 14 | 0.3% |
| 0.02127659574 | 13 | 0.3% |
| 0.03448275862 | 13 | 0.3% |
| Other values (1233) | 2674 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 0.005449591281 | 1 | < 0.1% |
| 0.005464480874 | 1 | < 0.1% |
| 0.005479452055 | 1 | < 0.1% |
| 0.005494505495 | 1 | < 0.1% |
| 0.005586592179 | 2 | < 0.1% |
| 0.005602240896 | 1 | < 0.1% |
| 0.005617977528 | 2 | < 0.1% |
| 0.00566572238 | 1 | < 0.1% |
| 0.005681818182 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 4 | 0.1% |
| 2 | 50 | 1.1% |
| 1.142857143 | 1 | < 0.1% |
| 1 | 1502 | |
| 0.75 | 1 | < 0.1% |
| 0.6666666667 | 4 | 0.1% |
| 0.5588235294 | 1 | < 0.1% |
| 0.5388739946 | 1 | < 0.1% |
gross_revenue
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 4282 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1900.16358 |
| Minimum | -4287.63 |
|---|---|
| Maximum | 279489.02 |
| Zeros | 11 |
| Zeros (%) | 0.3% |
| Negative | 41 |
| Negative (%) | 0.9% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -4287.63 |
|---|---|
| 5-th percentile | 101.175 |
| Q1 | 293.1875 |
| median | 648.55 |
| Q3 | 1612.625 |
| 95-th percentile | 5632.72 |
| Maximum | 279489.02 |
| Range | 283776.65 |
| Interquartile range (IQR) | 1319.4375 |
Descriptive statistics
| Standard deviation | 8224.856057 |
|---|---|
| Coefficient of variation (CV) | 4.328498948 |
| Kurtosis | 606.3365532 |
| Mean | 1900.16358 |
| Median Absolute Deviation (MAD) | 455.265 |
| Skewness | 21.69086914 |
| Sum | 8296114.19 |
| Variance | 67648257.16 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 11 | 0.3% |
| 76.32 | 4 | 0.1% |
| 15 | 3 | 0.1% |
| 440 | 3 | 0.1% |
| 35.4 | 3 | 0.1% |
| 363.65 | 3 | 0.1% |
| 321.05 | 2 | < 0.1% |
| 181.09 | 2 | < 0.1% |
| 179 | 2 | < 0.1% |
| 73.5 | 2 | < 0.1% |
| Other values (4272) | 4331 |
| Value | Count | Frequency (%) |
| -4287.63 | 1 | |
| -1592.49 | 1 | |
| -1192.2 | 1 | |
| -1165.3 | 1 | |
| -1126 | 1 | |
| -840.76 | 1 | |
| -611.86 | 1 | |
| -451.42 | 1 | |
| -295.09 | 1 | |
| -227.44 | 1 |
| Value | Count | Frequency (%) |
| 279489.02 | 1 | |
| 256438.49 | 1 | |
| 187482.17 | 1 | |
| 132572.62 | 1 | |
| 123725.45 | 1 | |
| 113384.14 | 1 | |
| 88125.38 | 1 | |
| 65892.08 | 1 | |
| 62653.1 | 1 | |
| 59419.34 | 1 |
relative_revenue
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1547 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9464585758 |
| Minimum | -1 |
|---|---|
| Maximum | 1 |
| Zeros | 11 |
| Zeros (%) | 0.3% |
| Negative | 41 |
| Negative (%) | 0.9% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0.7752317671 |
| Q1 | 0.9773601379 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 2 |
| Interquartile range (IQR) | 0.02263986212 |
Descriptive statistics
| Standard deviation | 0.2102841969 |
|---|---|
| Coefficient of variation (CV) | 0.2221800322 |
| Kurtosis | 58.90281829 |
| Mean | 0.9464585758 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -7.214265949 |
| Sum | 4132.238142 |
| Variance | 0.04421944345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2778 | |
| -1 | 33 | 0.8% |
| 0 | 11 | 0.3% |
| 0.9288798578 | 1 | < 0.1% |
| 0.9765744129 | 1 | < 0.1% |
| 0.553801307 | 1 | < 0.1% |
| 0.751558053 | 1 | < 0.1% |
| 0.9830745685 | 1 | < 0.1% |
| 0.9730736475 | 1 | < 0.1% |
| 0.7954217051 | 1 | < 0.1% |
| Other values (1537) | 1537 |
| Value | Count | Frequency (%) |
| -1 | 33 | |
| -0.965547338 | 1 | < 0.1% |
| -0.5961406632 | 1 | < 0.1% |
| -0.4064582855 | 1 | < 0.1% |
| -0.3712984055 | 1 | < 0.1% |
| -0.3333333333 | 1 | < 0.1% |
| -0.2561983471 | 1 | < 0.1% |
| -0.1608589951 | 1 | < 0.1% |
| -0.05743520353 | 1 | < 0.1% |
| 0 | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 2778 | |
| 0.9998361672 | 1 | < 0.1% |
| 0.9996863718 | 1 | < 0.1% |
| 0.9994487783 | 1 | < 0.1% |
| 0.9994162442 | 1 | < 0.1% |
| 0.9993461799 | 1 | < 0.1% |
| 0.9992302394 | 1 | < 0.1% |
| 0.999085358 | 1 | < 0.1% |
| 0.9990001507 | 1 | < 0.1% |
| 0.9986406047 | 1 | < 0.1% |
relative_quantity
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1127 |
|---|---|
| Distinct (%) | 25.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9273829918 |
| Minimum | -1 |
|---|---|
| Maximum | 1 |
| Zeros | 18 |
| Zeros (%) | 0.4% |
| Negative | 47 |
| Negative (%) | 1.1% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0.6008860448 |
| Q1 | 0.9646793976 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 2 |
| Interquartile range (IQR) | 0.03532060239 |
Descriptive statistics
| Standard deviation | 0.2302652918 |
|---|---|
| Coefficient of variation (CV) | 0.2482957892 |
| Kurtosis | 40.57015379 |
| Mean | 0.9273829918 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -5.825843262 |
| Sum | 4048.954142 |
| Variance | 0.05302210461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2778 | |
| -1 | 33 | 0.8% |
| 0 | 18 | 0.4% |
| 0.6 | 9 | 0.2% |
| 0.9830508475 | 8 | 0.2% |
| 0.8947368421 | 7 | 0.2% |
| 0.9512195122 | 7 | 0.2% |
| 0.9718309859 | 7 | 0.2% |
| 0.9672131148 | 7 | 0.2% |
| 0.9574468085 | 6 | 0.1% |
| Other values (1117) | 1486 |
| Value | Count | Frequency (%) |
| -1 | 33 | |
| -0.9895287958 | 1 | < 0.1% |
| -0.9874213836 | 1 | < 0.1% |
| -0.503649635 | 1 | < 0.1% |
| -0.2978723404 | 1 | < 0.1% |
| -0.2546374368 | 1 | < 0.1% |
| -0.1823176508 | 1 | < 0.1% |
| -0.1363636364 | 1 | < 0.1% |
| -0.1194029851 | 1 | < 0.1% |
| -0.1180124224 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 2778 | |
| 0.9998446361 | 1 | < 0.1% |
| 0.9993894994 | 1 | < 0.1% |
| 0.9993149121 | 1 | < 0.1% |
| 0.9992890153 | 1 | < 0.1% |
| 0.9990356798 | 1 | < 0.1% |
| 0.9987129987 | 1 | < 0.1% |
| 0.9986666667 | 1 | < 0.1% |
| 0.9986130374 | 1 | < 0.1% |
| 0.9985304923 | 1 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | customer_id | purchases | devolutions | recency_p | recency_d | quantity_p | quantity_d | invoices_p | invoices_d | avg_ticket | avg_recency_days | avg_basket_size | avg_variety | purchases_pday | gross_revenue | relative_revenue | relative_quantity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 17850 | 5.391210e+03 | 1.025800e+02 | 3.720000e+02 | 3.020000e+02 | 3.500000e+01 | 2.100000e+01 | 3.400000e+01 | 1.000000e+00 | 1.815222e+01 | 1.006667e+02 | 5.097059e+01 | 8.735294e+00 | 1.700000e+01 | 5.288630e+03 | 9.626560e-01 | 2.500000e-01 |
| 1 | 1 | 13047 | 3.237540e+03 | 1.584400e+02 | 3.100000e+01 | 3.100000e+01 | 1.320000e+02 | 6.000000e+00 | 1.000000e+01 | 8.000000e+00 | 1.882291e+01 | 2.214286e+00 | 1.391000e+02 | 1.720000e+01 | 2.915452e-02 | 3.079100e+03 | 9.066897e-01 | 9.130435e-01 |
| 2 | 2 | 12583 | 7.281380e+03 | 9.404000e+01 | 2.000000e+00 | 5.600000e+01 | 1.569000e+03 | 5.000000e+01 | 1.500000e+01 | 3.000000e+00 | 2.947927e+01 | 1.111111e-01 | 3.373333e+02 | 1.646667e+01 | 4.032258e-02 | 7.187340e+03 | 9.744991e-01 | 9.382335e-01 |
| 3 | 3 | 13748 | 9.482500e+02 | 0.000000e+00 | 9.500000e+01 | 3.650000e+02 | 1.690000e+02 | -0.000000e+00 | 5.000000e+00 | 0.000000e+00 | 3.386607e+01 | 2.375000e+01 | 8.780000e+01 | 5.600000e+00 | 1.792115e-02 | 9.482500e+02 | 1.000000e+00 | 1.000000e+00 |
| 4 | 4 | 15100 | 8.760000e+02 | 2.409000e+02 | 3.330000e+02 | 3.300000e+02 | 4.800000e+01 | 2.200000e+01 | 3.000000e+00 | 3.000000e+00 | 2.920000e+02 | 5.500000e+01 | 2.666667e+01 | 1.000000e+00 | 7.317073e-02 | 6.351000e+02 | 5.686275e-01 | 3.714286e-01 |
| 5 | 5 | 15291 | 4.668300e+03 | 7.179000e+01 | 2.500000e+01 | 1.720000e+02 | 5.080000e+02 | 2.700000e+01 | 1.500000e+01 | 5.000000e+00 | 4.532330e+01 | 1.470588e+00 | 1.402000e+02 | 6.866667e+00 | 4.297994e-02 | 4.596510e+03 | 9.697094e-01 | 8.990654e-01 |
| 6 | 6 | 14688 | 5.630870e+03 | 5.234900e+02 | 7.000000e+00 | 7.000000e+00 | 5.790000e+02 | 2.810000e+02 | 2.100000e+01 | 6.000000e+00 | 1.721979e+01 | 3.333333e-01 | 1.724286e+02 | 1.557143e+01 | 5.722071e-02 | 5.107380e+03 | 8.298800e-01 | 3.465116e-01 |
| 7 | 7 | 17809 | 5.411910e+03 | 7.842900e+02 | 1.600000e+01 | 1.600000e+01 | 9.610000e+02 | 4.100000e+01 | 1.200000e+01 | 3.000000e+00 | 8.871984e+01 | 1.333333e+00 | 1.714167e+02 | 5.083333e+00 | 3.351955e-02 | 4.627620e+03 | 7.468481e-01 | 9.181637e-01 |
| 8 | 8 | 15311 | 6.076790e+04 | 1.348560e+03 | 0.000000e+00 | 0.000000e+00 | 2.167000e+03 | 2.310000e+02 | 9.100000e+01 | 2.700000e+01 | 2.554346e+01 | 0.000000e+00 | 4.197143e+02 | 2.614286e+01 | 2.433155e-01 | 5.941934e+04 | 9.565796e-01 | 8.073394e-01 |
| 9 | 9 | 14527 | 8.508820e+03 | 7.974400e+02 | 2.000000e+00 | 8.000000e+00 | 1.980000e+02 | 3.000000e+00 | 5.500000e+01 | 3.100000e+01 | 8.753930e+00 | 3.125000e-02 | 3.798182e+01 | 1.767273e+01 | 1.494565e-01 | 7.711380e+03 | 8.286229e-01 | 9.701493e-01 |
Last rows
| df_index | customer_id | purchases | devolutions | recency_p | recency_d | quantity_p | quantity_d | invoices_p | invoices_d | avg_ticket | avg_recency_days | avg_basket_size | avg_variety | purchases_pday | gross_revenue | relative_revenue | relative_quantity | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4356 | 5950 | 16000 | 1.239370e+04 | 0.000000e+00 | 2.000000e+00 | 3.650000e+02 | 7.700000e+02 | -0.000000e+00 | 3.000000e+00 | 0.000000e+00 | 1.377078e+03 | 2.000000e+00 | 1.703333e+03 | 3.000000e+00 | 3.000000e+00 | 1.239370e+04 | 1.000000e+00 | 1.000000e+00 |
| 4357 | 5951 | 15195 | 3.861000e+03 | 0.000000e+00 | 2.000000e+00 | 3.650000e+02 | 1.404000e+03 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 3.861000e+03 | 2.000000e+00 | 1.404000e+03 | 1.000000e+00 | 1.000000e+00 | 3.861000e+03 | 1.000000e+00 | 1.000000e+00 |
| 4358 | 5952 | 14087 | 1.944200e+02 | 1.275000e+01 | 2.000000e+00 | 2.000000e+00 | 1.130000e+02 | 1.000000e+00 | 1.000000e+00 | 1.000000e+00 | 2.817681e+00 | 2.000000e+00 | 2.510000e+02 | 6.900000e+01 | 1.000000e+00 | 1.816700e+02 | 8.769127e-01 | 9.824561e-01 |
| 4359 | 5953 | 14204 | 1.610300e+02 | 0.000000e+00 | 2.000000e+00 | 3.650000e+02 | 2.100000e+01 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 3.659773e+00 | 2.000000e+00 | 8.200000e+01 | 4.400000e+01 | 1.000000e+00 | 1.610300e+02 | 1.000000e+00 | 1.000000e+00 |
| 4360 | 5954 | 15471 | 4.694800e+02 | 0.000000e+00 | 2.000000e+00 | 3.650000e+02 | 1.020000e+02 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 6.097143e+00 | 2.000000e+00 | 2.660000e+02 | 7.700000e+01 | 1.000000e+00 | 4.694800e+02 | 1.000000e+00 | 1.000000e+00 |
| 4361 | 5958 | 13436 | 1.968900e+02 | 0.000000e+00 | 1.000000e+00 | 3.650000e+02 | 5.800000e+01 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 1.640750e+01 | 1.000000e+00 | 7.600000e+01 | 1.200000e+01 | 1.000000e+00 | 1.968900e+02 | 1.000000e+00 | 1.000000e+00 |
| 4362 | 5960 | 15520 | 3.435000e+02 | 0.000000e+00 | 1.000000e+00 | 3.650000e+02 | 1.340000e+02 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 1.908333e+01 | 1.000000e+00 | 3.140000e+02 | 1.800000e+01 | 1.000000e+00 | 3.435000e+02 | 1.000000e+00 | 1.000000e+00 |
| 4363 | 5962 | 13298 | 3.600000e+02 | 0.000000e+00 | 1.000000e+00 | 3.650000e+02 | 9.600000e+01 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 1.800000e+02 | 1.000000e+00 | 9.600000e+01 | 2.000000e+00 | 1.000000e+00 | 3.600000e+02 | 1.000000e+00 | 1.000000e+00 |
| 4364 | 5963 | 14569 | 2.273900e+02 | 0.000000e+00 | 1.000000e+00 | 3.650000e+02 | 7.000000e+01 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 1.894917e+01 | 1.000000e+00 | 7.900000e+01 | 1.200000e+01 | 1.000000e+00 | 2.273900e+02 | 1.000000e+00 | 1.000000e+00 |
| 4365 | 5970 | 12713 | 8.485500e+02 | 0.000000e+00 | 0.000000e+00 | 3.650000e+02 | 1.010000e+02 | -0.000000e+00 | 1.000000e+00 | 0.000000e+00 | 2.233026e+01 | 0.000000e+00 | 5.080000e+02 | 3.800000e+01 | 1.000000e+00 | 8.485500e+02 | 1.000000e+00 | 1.000000e+00 |